AITopics | adjusted rand index

Toward Interpretable Evaluation Measures for Time Series Segmentation

Neural Information Processing SystemsJun-14-2026, 13:26:35 GMT

Time series segmentation is a fundamental task in analyzing temporal data across various domains, from human activity recognition to energy monitoring. While numerous state-of-the-art methods have been developed to tackle this problem, the evaluation of their performance remains critically limited. Existing measures predominantly focus on change point accuracy or rely on point-based measures such as Adjusted Rand Index (ARI), which fail to capture the quality of the detected segments, ignore the nature of errors, and offer limited interpretability. In this paper, we address these shortcomings by introducing two novel evaluation measures: WARI (Weighted Adjusted Rand Index), that accounts for the position of segmentation errors, and SMS (State Matching Score), a fine-grained measure that identifies and scores four fundamental types of segmentation errors while allowing error-specific weighting. We empirically validate WARI and SMS on synthetic and real-world benchmarks, showing that they not only provide a more accurate assessment of segmentation quality but also uncover insights, such as error provenance and type, that are inaccessible with traditional measures.

data mining, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (0.93)
Asia (0.67)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.69)
(4 more...)

Add feedback

11e3e0f1b29dcd31bd0952bfc1357f68-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsApr-25-2026, 01:15:05 GMT

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia (1.00)
North America > United States > Minnesota (0.28)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine (0.68)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(4 more...)

Add feedback

206018a258033def63607fbdf364bd2d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 07:27:18 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
(15 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Add feedback

735ddec196a9ca5745c05bec0eaa4bf9-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 22:29:24 GMT

mice, protein, section 5, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.51)

Add feedback

Appendices 619 A Additional Experiments 620

Neural Information Processing SystemsFeb-8-2026, 00:49:35 GMT

Table 6: Results of selected models on Task 1 (Grouping) using contextual embeddings. In this section, we provide additional t-SNE projections of embeddings from various methods used. Figure 7: Solved wall for Task 1 (Grouping) using GloV e. Left: ( " Suspension" is " a term used in musical harmony " in this context. Grief " in the embedding space, which matches the " Good ___! " connection. Figure 8: Solved wall for Task 1 (Grouping) using FastText (Crawl). Left: contextual embedding solved 3/4 groups. Here the clue " Rambrandt" is placed near other Dutch painters. Right: static embedding solved 0/4 groups. The following section provides answers to questions listed in datasheets for datasets. For what purpose was the dataset created? Was there a specific task in mind? Who created this dataset (e.g., which team, research group) and on behalf of which entity (e.g., The dataset has been collectively curated by the authors of this paper. What support was needed to make this dataset?

artificial intelligence, large language model, natural language, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.48)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.31)

Add feedback

Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset

Neural Information Processing SystemsFeb-8-2026, 00:49:32 GMT

While recent NLP evaluation benchmark tasks test some aspects of human-imitative behavior (e.g., BIG-bench's'human-like behavior' tasks), few, if not none, examine creative problem solving abilities.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Maryland > Baltimore (0.04)
(13 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine (0.68)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

A Hybrid Computational Intelligence Framework for scRNA-seq Imputation: Integrating scRecover and Random Forests

Anaissi, Ali, Liu, Deshao, Jia, Yuanzhe, Huang, Weidong, Alyassine, Widad, Akram, Junaid

arXiv.org Artificial IntelligenceNov-24-2025

Single-cell RNA sequencing (scRNA-seq) enables transcrip-tomic profiling at cellular resolution but suffers from pervasive dropout events that obscure biological signals. We present SCR-MF, a modular two-stage workflow that combines principled dropout detection using scRecover with robust non-parametric imputation via missForest. Across public and simulated datasets, SCR-MF achieves robust and interpretable performance comparable to or exceeding existing imputation methods in most cases, while preserving biological fidelity and transparency. Runtime analysis demonstrates that SCR-MF provides a competitive balance between accuracy and computational efficiency, making it suitable for mid-scale single-cell datasets.

artificial intelligence, imputation, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2511.16923

Country:

Asia (0.28)
Oceania > Australia (0.15)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Oh That Looks Familiar: A Novel Similarity Measure for Spreadsheet Template Discovery

Krishnakumar, Anand, Ravikumaran, Vengadesh

arXiv.org Artificial IntelligenceNov-12-2025

Traditional methods for identifying structurally similar spreadsheets fail to capture the spatial layouts and type patterns defining templates. To quantify spreadsheet similarity, we introduce a hybrid distance metric that combines semantic embeddings, data type information, and spatial positioning. In order to calculate spreadsheet similarity, our method converts spreadsheets into cell-level embeddings and then uses aggregation techniques like Chamfer and Hausdorff distances. Experiments across template families demonstrate superior unsupervised clustering performance compared to the graph-based Mondrian baseline, achieving perfect template reconstruction (Adjusted Rand Index of 1.00 versus 0.90) on the FUSTE dataset. Our approach facilitates large-scale automated template discovery, which in turn enables downstream applications such as retrieval-augmented generation over tabular collections, model training, and bulk data cleaning.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2511.06973

Genre: Research Report (0.52)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Toward Interpretable Evaluation Measures for Time Series Segmentation

Chavelli, Félix, Boniol, Paul, Thomazo, Michaël

arXiv.org Artificial IntelligenceOct-28-2025

Time series segmentation is a fundamental task in analyzing temporal data across various domains, from human activity recognition to energy monitoring. While numerous state-of-the-art methods have been developed to tackle this problem, the evaluation of their performance remains critically limited. Existing measures predominantly focus on change point accuracy or rely on point-based measures such as Adjusted Rand Index (ARI), which fail to capture the quality of the detected segments, ignore the nature of errors, and offer limited interpretability. In this paper, we address these shortcomings by introducing two novel evaluation measures: WARI (Weighted Adjusted Rand Index), that accounts for the position of segmentation errors, and SMS (State Matching Score), a fine-grained measure that identifies and scores four fundamental types of segmentation errors while allowing error-specific weighting. We empirically validate WARI and SMS on synthetic and real-world benchmarks, showing that they not only provide a more accurate assessment of segmentation quality but also uncover insights, such as error provenance and type, that are inaccessible with traditional measures.

data mining, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.23261

Country: